Worst-case optimal algorithm for XPath evaluation over XML streams

نویسنده

  • Prakash Ramanan
چکیده

We consider the XPath evaluation problem: Evaluate an XPath query Q on a streaming XML documentD; i.e., determine the setQ(D) of document elements selected byQ. We mainly consider Conjunctive XPath queries that involve only the child and descendant axes. Previously known in-memory algorithms for this problem use O(|D|) space and O(|Q||D|) time. Several previously known algorithms for the streaming version use Ω(d) space and Ω(d|D|) time in the worst case; d denotes the depth of D, and n denotes the number of location steps in Q. Their exponential space requirement could well exceed the O(|D|) space used by the in-memory algorithms. We present an efficient algorithm that uses O(d|Q| + nc) space and O((|Q| + dn)|D|) time in the worst case; c denotes the maximum number of elements ofD that can be candidates for output, at any one instant. For some worst case Q and D, the memory space used by our algorithm matches our lower bound proved in a different paper; so, our algorithm uses optimal memory space in the worst case. keywords. XML, XPath, query evaluation, stream processing

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Memory lower bounds for XPath evaluation over XML streams

We consider the XPath evaluation problem: Evaluate an XPath query Q on a streaming XML document D. We consider two versions of the problem: 1). Filtering Problem: Determine if there is a match for Q in D. 2). Node Selection Problem: Determine the set Q(D) of document nodes selected by Q. We consider Conjunctive XPath (CXPath) queries that involve only the child and descendant axes. Let d denote...

متن کامل

Online Dictionary Matching for Streams of XML Documents

We consider the online multiple-pattern matching problem for streams of XML documents, when the patterns are expressed as linear XPath expressions containing child operators (/), descendant operators (//) and wildcards (∗) but no predicates. For each document in the stream, the task is to determine all occurrences in the document of all the patterns. We present a general multiple-pattern-matchi...

متن کامل

QuickXScan: Efficient Streaming XPath Evaluation

Many XML applications over the Internet favor high-performance single-pass streaming XPath evaluation. Finite automata-based algorithms suffer from potentially combinatorial explosion of dynamic states for matching descendant axes. We present QuickXScan for streaming evaluation of XPath queries containing child and descendant axes with complex predicates. Using a tree representation for an XPat...

متن کامل

Evaluating an XPath Query on a Streaming XML Document

We present an efficient algorithm for evaluating an XPath query Q (involving only child and descendant axes) on a streaming XML document D. Previously known in-memory algorithms for XPath evaluation use O(|D|) space and O(|Q||D|) time. Several previous algorithms for the streaming version use Θ(d +c) space and Θ(d|D|) time in the worst case; d is the depth of D, n is the number of location step...

متن کامل

An Evaluation Study of Search Algorithms for XML Streams

Keyword-based searching services over XML streams are essential for widely used streaming applications, such as dissemination services, sensor networks and stock market quotes. However, XML stream keyword search algorithms are usually schema dependent and do not allow pure keyword queries. Furthermore, ranking methods are still relatively unexploited in such algorithms. This paper presents an a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • J. Comput. Syst. Sci.

دوره 75  شماره 

صفحات  -

تاریخ انتشار 2009